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We study the convex hull of the set of points visited by a two-dimensional random walker of T 
discrete time steps. Two natural observables that characterize the convex hull in two dimensions 
are its perimeter L and area A. While the mean perimeter (L) and the mean area (A) have been 
studied before, analytically and numerically, and exact results are known for large T (Brownian 
motion limit), little is known about the full distributions P{A) and P{L). In this paper, we provide 
numerical results for these distributions. We use a sophisticated large-deviation approach that 
allows us to study the distributions over a larger range of the support, where the probabilities P{A) 
and P{L) are as small as 10“®°°. We analyse (open) random walks as well as (closed) Brownian 
bridges on the two-dimensional discrete grid as well as in the two-dimensional plane. The resulting 
distributions exhibit, for large T, a universal scaling behavior (independent of the details of the 
jump distributions) as a function oi A/T and L/y/P, respecively. We are also able to obtain the 
rate function, describing rare events at the tails of these distributions, via a numerical extrapolation 
scheme and find a linear and square dependence as a function of the rescaled perimeter and the 
rescaled area, respectively. 
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I. INTRODUCTION 

Random walks, originally introduced in 1921 by G. 
Polya [T], have since been a vital field of research. They 
are ubiquitous models for physical, biological and so¬ 
cial processes [HU. Example applications from biology 
include self-propelled motion of bacteria and the diffu¬ 
sion of nutrients [5], as well as animal motion in general 
011] or during the marking of territories or description of 
home ranges [US]. For the latter case a strong increase 
of the amount of experimentally available data ocurred 
after the introduction of automated radio/GPS tagging 
of animals [101 lllj . The usage of minimum convex poly¬ 
gons, called convex hulls, bordering the trace of an ani¬ 
mal 0(11] is a simple yet versatile [13] way to describe 
the home range and can be used for any type of (random- 
walk) data. In two dimensions, the convex hull of a point 
set is the minimum subset whose elements form a convex 
polygon in such a way that (a) all points of the set and 
(b) the connecting lines between all possible pairs lie in¬ 
side the polygon. The convex hull is a suitable measure 
for time-discretized random walks because it can easily 
be attributed with geometrical quantities, i.e., hull area 
A and perimeter L. Also, for numerical calculations, sim¬ 
ple and already quite fast algorithms with running times 
0{N ■ log TV) exist [TIHTB] . 
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On the analytical side, much progress have been made 
for asymptotic results at long times, when a random walk 
(with a finite variance for step sizes) converges to the 
continuous-time Brownian motion (for a recent review 
see Ref. m)- For a single Brownian motion of length T 
in two dimensions, the mean perimeter [mds] and the 
mean area [30] were computed long back. 

Recently, adapting Gauchy’s formula [21] for convex 
curves in two dimensions to random curves, it was 
shown that the problem of computing the mean 

perimeter and the mean area of the convex hull of an arbi¬ 
trary two dimensional stochastic process can be mapped 
to computing the extremal statistics of the one dimen¬ 
sional component of the process. This procedure was suc¬ 
cessfully applied recently to compute the mean perimeter 
and the mean area of several two dimensional stochas¬ 
tic processes such as N independent Brownian motions 
in 2-d [TT] IMj . random acceleration process in 2-d [25) . 
2-d branching Brownian motions with absorption with 
applications to edpidemic outbreak |26] and 2-d anoma¬ 
lous diffusion processes m- Very recently, this method 
was also successfully used to compute the exact mean 
perimeter of the convex hull of a planar Brownian mo¬ 
tion confined to a half-space [28]. Finally, using different 
methods, the mean perimeter and the mean area of the 
convex hull of a single Brownian motion, but in arbitrary 
dimensions, have been computed recently in the mathe¬ 
matics literature [29l[30]. 

The question naturally arises regarding the higher mo¬ 
ments or even the full distribution of the perimeter and 
the area of the convex hulls of two dimensional random 
walks. Computing analytically even the second moment 
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(and hence the variance), for the perimeter and the area 
of the convex hull of a single two dimensional Brownian 
motion, turns out to be highly difficult [551 US] ■ Since 
so far no analytical results are available concerning the 
full distributions, performing numerical studies is a nat¬ 
ural approach. Within a straightforward implementa¬ 
tion (“simple sampling”), one generates many times in¬ 
dependently a random walk of T steps and constructss 
the hull polygon and, subsequently computes its area A 
and perimeter L. Then one records histograms of these 
quantities. Nevertheless, this approach only gives insight 
into a small portion of the actual distributions P(A) and 
P{L), i.e., configurations with P oc K~^, the inverse of 
the number K of samples. This demands for specihc cal¬ 
culations with respect to the large-deviation properties 
of this model. So, we use a particular large-deviation 
approach (STJ [35] which randomly alters steps of the ran¬ 
dom walk, compares the effected change in terms of either 
of the quantities and accepts the alteration according a 
Metropolis criterion involving an artifical Monte Carlo 
“temperature” 0. This is performed for various values 
of 0, giving access to different ranges of the distribu¬ 
tion under scrutiny. Afterwards, the resulting distribu¬ 
tions Pe^A) or Pq{L) can be merged according to a given 
scheme. In the end, we are able to obtain distributions 
P{A) and P{L) which cover a range in principle only 
limited by the numerical precision of our computers. 

The rest of the paper is organized as follows: Sec. |ll] 
will introduce the used random walk model, the associ¬ 
ated convex hull and then will shortly allude to the algo¬ 
rithms and some heuristics. Following this, in Sec. |III| we 
will introduce the large-deviation method. Our results 
are presented in Sec. |IV[ and we will finally conclude in 
Sec. |V| giving an outlook to possible future work. 

II. RANDOM WALKS, CONVEX HULLS AND 
ALGORITHMS 

A time-discretized random walk consists of T step vec¬ 
tors 5i, and the position x{t) at timestep t < T is the 
sum of all steps up to r, i.e.: 

r 

x(T)=fo+^<^i (1) 

i=l 

The walk conhguration itself is then the set W = 
{di, di,..., dr} of steps [3Sj. The step Si = {Sx,i,Sy^i) 
itself denotes a displacement of the particle by dx,i in 
x-direction and Sy^i in y-direction. We consider two dif¬ 
ferent types of steps: 

1. A time-discrete approximation to a Brownian walk, 
i.e., both Sx^i and Sy^i are, for each i, drawn ran¬ 
domly from a Gaussian distribution with zero mean 
and variance one. 

2. A walk with discrete steps corresponding to mo¬ 
tion on a square lattice of spacing J. This 


corresponds to the four possible steps x G 
{(0, J), (0, — J), (—J, 0), (J, 0)}, with each of these 
possible up/down/left/right steps having probabil¬ 
ity i- 

Note that in the limit of long walks T —>■ oo Gaussian 
walks and discrete lattice walks should statisically agree 
when setting J = \f2. We also consider closed random 
walks which return to the origin, i.e., a:(T) = a:(0): 

1. For the lattice case, we generate only the first half 
of the walk randomly. The second half is filled up 
with inverse steps dj_|_T = —Si. Finally the order 
of the steps is randomized via swapping randomly 
selected pairs of steps. 

2. For the Gaussian case, we consider Brownian 
bridges [34], i.e., from the walk x{t) we construct 

Xbij) = x{t) - ^x{T) (2) 

which fulfill Xb(T) = a:(0) and have all necessary 
statistical properties of random walks. 

The convex hull C = conv(7^) of a two-dimensional 
point set V = {Pi}, Pi G is described through a con¬ 
vex set over V. The points P within C are given by 
all possible combinations P = oaPi with Pi G V and 

ai = 1 and ai G K)}' (definition given according to 
[35] 1. This means: 

1. All points Pi GV lie within C. 

2. All lines PiPf, Pi, Pj G V also lie within C. 

The boundary of the convex set is a polygon which 
connects a subset V C V oi H points from the point 
set, i.e., V = {Po,Pi, ■ ■ ■,Ph-i}, with Pi = {xi,yi) {i = 
0,..., H — 1). The hull is attributed with area A and 
perimeter L according to (identifying i = H with f = 0): 

1 

MC) = ^'^{yx + Vi+i) ■ (x^-Xi+i) (3) 

H-1 

L{C) = ^ V {xi - a:*+i)2 -b (y* - y*+i)2 (4) 

i=0 

For our work, we determined the polygons bordering 
convex hulls (for which one uses shortly the term “con¬ 
vex hull”) numerically. The most common convex hull 
algorithms operate in 0{N\ogN). We used Andrew’s 
variant [16] of the “Graham Scan” algorithm [14], which 
constructs the convex hull by first drawing a dividing line 
through the point set and then by sorting out those points 
which don’t form monotone chain of clockwise/counter¬ 
clockwise turns on each side of the line. In usual cases, 
the application of convex hull algorithms can be accel¬ 
erated by usage of pre-selection heuristics, such as the 
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one introduced by Akl and Toussaint [36] . This heuristic 
looks up extreme points of the set (i.e., those of maxi¬ 
mum and minimum x- and y-coordinates) and discards 
all points which lie inside the quadrilateral formed by 
these points. We use a custom refinement of this heuris¬ 
tic, which is based on iterating the heuristic under rota¬ 
tion of the coordinate origin, which eliminates another 
fraction of inert points per each iteration. 

III. LARGE-DEVATION SCHEME 

For simple-sampling results, walk configurations W are 
generated randomly, and the according convex hulls C 
are calculated through the algorithm, resulting in a mul¬ 
titude values of A and L. As mentioned above, obtaining 
histograms of these values only gives access to the high 
probability regime, where the convex- hull-properties of 
typical random walks are measured. However, in order 
to obtain values of these quantities with especially low 
probabilities, allowing us the measure the distributions 
P{A) and P{L) over a large range of the supoort, a cer¬ 
tain Markov-Chain Monte Carlo (MCMC) scheme can be 
used [SI Eg. 

The MCMC consist of an evolution of random walks 
t being another discrete “time” parameter, not to 
be confused with the time parameter r of the random 
walks. For the walks, we measure the property S{t)^ 
i.e., the area {S = A) or perimeter {S = L), depend¬ 
ing on which distribution we are aiming at. The initial 
configuration >V(0) is any walk configuration, e.g., a ran¬ 
domly chosen one. At each Monte Carlo step t, the walk 
W(t) is altered to W* by replacing a randomly selected 
step 6i {i S {1,2,...,T}) with a newly generated step 
(5'. The new step is generated according the same dis¬ 
tribution as all random walks steps of the corresponding 
type, e.g., Gaussian. The convex hull of W* is calculated, 
leading to quantitiy S*. The alteration W* is accepted 
(yy(t-l-l) = W*) according to the Metropolis probability: 



Here, 0 is the (artificial) Monte Carlo “temperature”, 
which is just a parameter used to set the range of the 
sampled values. If the alteration is not accepted, it is 
rejected, i.e., >V(t -I- 1) = VV’(t). Now, over the course of 
the MC steps, A or L develops according to the effected 
changes and 0. Examples are shown in Fig. As usual, 
the MCMC time is measured in terms of sweeps, i.e., 
number of steps per system size, which is here the walk 
length T. 

Like in any MCMC simulation one must equilibrate the 
simulation, i.e., discart the initial part of the measured 
quantities until “typical” values are found. Equilibra¬ 
tion was found to be difficult but possible for simulations 
regarding hull area A. This was in particular difficult 
for Gaussian-distributed Si components, probably due to 



FIG. 1. (color online) Different renditions of closed walks of 
length T = 30 after Tmc = 3T0® Monte Carlo steps, i.e., Ts = 
10^ sweeps, at the given temperatures. The top row features 
lattice steps, while the bottom row shows Gaussian steps, plus 
the according convex hulls (indicated by the dashed lines). 
The walks to the left are open and those to the right closed. 
Higher temperatures inhibit the acceptance of step alterations 
which lower the hull area A, thus the walks and their hulls 
experience drastic growth. 

the fact that there exists no upper limit for A and L, in 
contrast to the discrete case. As demonstrated in Fig. 

0 = —16.25 it still took several tg oc 10® MC sweeps un¬ 
til equilibration. Opposed to this, simulations regarding 
perimeter L equilibrated rather quickly, typically within 
less than ts « 10® sweeps. This differing behavior seems 
to result from the fact that the replacement of one single 
step Si may affect much larger changes in A than in L, 
thereby making the whole model much more sensitive, 
especially for small value of T. 

For a given walk type (discrete/Gaussian, 
open/closed), a given walk length T and a given 
quanity (A or L) we performed simulations for different 
values of 0 = {0i,..., 0if}. The numer K of tem¬ 
peratures and the actual values, posive and negatives 
ones, depend heavily on the model, the walk length and 
the measured quantity, see below how the temperatures 
are chosen. Thus, we got different distributions Pq{S) 
which are related with the actual distribution P{S) 
according to the following relation m 

P{S) = e^/^ ■ Z{Q) ■ Pe{S) 


( 6 ) 
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FIG. 2. Equilibration of the hull area A of open Gaussian 
walks over the number of Monte Garlo steps, normalized to 
G by walk length T = 50. For 0 = —16.25, the model is 
fully equilibrated after ca. ts = 5 - 10"^ sweeps, and heedless of 
the differing initial configurations (see inset), i.e., A behaves 
similarly within the range of fluctuations. 


Note that “distribution” here either means “probabil¬ 
ity” or “density”, depending on whether the possible val¬ 
ues of S are discrete or continuous. For different val¬ 
ues of 0, different ranges of the measured value S were 
obtained. This allows for a picewise reconstruction of 
P{S). It only requires knowledge of the normalization 
constants Z{Q). They can be calculated through inver¬ 
sion of this formula whenever for two values 0i and 02 
the ranges of the sampled values of S overlap. Thus, 
the temperatures were chosen such that for neighbouring 
temperatures the measured histograms sufficiently over¬ 
lap. In principle, for each value of S where the mea¬ 
sured values Pe^{S) > 0 and Pe^iS) > 0 one estimate 
of Z(0i)/Z(02) is obtained. Note that in case of equli- 
bration, this ratio is more or less constant for all values 
of S where the two histograms overlap. One the other 
hand, for non-equilibrated cases a systematic dependence 
on S is seen. In this way we have another convenient 
criterion to verify equilibration. Via these pairwise com¬ 
parisons, all ratios Z{Qk)/Z{Qk+i) can be determined. 
Finally, the overall determination of the normalization 
constants is obtained from the global normalization con¬ 
straint J2s = 1 (or / P{S) dS = 1 for the continu¬ 
ous case). 


IV. RESULTS 
A. Independent Points 

To verify our simulations, we first simulated convex 
hulls of n independently distributed points in the unit 
square [0,1]^. For this case some mathematical results 
are known |37H39j . In particular the remaining area 
An = I — A{n) outside the convex hull is considered. 
The distribution of the rescaled remaining area 

{An - 46„)/2c„ , (7) 

where bn = and c„ = should converge 

for n —>■ oo in distribution to the standard normal distri¬ 
bution A/’(0,1). 



FIG. 3. Average /r and variance cr^ (inset) of the remaining 
area An of a unit square in presence of a convex hull of n ran¬ 
domly distributed points. The symbols show the numerical 
results while the lines show 46„ and 2c„, respectively. 

Thus, for the unsealed data A{n)^ the mean ^ and the 
standard deviation a should follow 46„ and 2c„, respec¬ 
tively. This is the case for our simulations, as visible in 
Fig.i where we indeed find a 46„ behavior for the mean 
and for larger sizes a 2c„ behavior for the variance. 

In a similar way, we attempted to visualize the pre¬ 
dicted convergence of the distributions towards .^0, 1) 
by plotting P(A„) • cr over (Jl„ — /r)/cr, cf. Fig. As 
visible, the measured density is very close to V(0,1) but 
no clear convergence is visible. Probably this would be 
visible only for much larger number n of points, thus 
the convergence is very slow. Note that these deviations 
become more pronounced in the small-probability tails, 
which we also obtained wihthin the MCMC approach for 
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selected numbers n (not shown here). Anyway, for the 
purpose of verifying our simulations, the agreement with 
the predicitions is sufficient. 



FIG. 4. As predicted by Cabo & Groeneboom |37], the distri¬ 
butions P(An) of the remaining area not covered by the con¬ 
vex hull over n random points inside the unit square should 
converge towards 1) when being rescaled according to 

Eq.[7l 


B. Random Walks 


For each type of walk and either of the two quantities 
A and L, we considered walks lengths T G [30, 2000] with 
ca. K = 10® samples per temperature 0. Through sam¬ 
pling multiple values of A or L per MC run (i.e., each 
tg = 10 sweeps), it was possible to obtain the results for 
each case within several days of CPU time on a average- 
size multi-core cluster (using up to few dozens of cores 
per case, corresponding to the number of temperatures 
0 ). 

An example of the resulting distributions, P(A) for the 
case of Gaussian open walks of various lengths, is shown 
in Fig. Evidently, we were able to obtain the distribu¬ 
tions over a large range of the support, with probability 
densities as small as 10“®®®. 

As the systematic behavior of the distributions P(A) 
for different walk lengths T hints a common origin at 
a distribution P(A) (cf. Fig. [^, the first property we 
put under scrutiny is the scaling behavior. The scaling 
behavior m of the mean (A) ^ T suggests that distri¬ 
butions P{A), which in fact depends on the walk length 
T, scales according to the following relation: 


p{A)=^-p(^^y ( 8 ) 

where P is a T-independent distribution. The application 
of the scaling assumption leads to a reliable collapse of 
the distributions P(A) towards P (y) with the exception 
of finite-size effects within the tail, as shown in the middle 
of Fig. [5] 

For perimeter distributions, we use a similar scaling 
assumption, which is based upon (L) oc a/T [T7] : 


The application of this relation to open walks, shown in 
Fig.§ reveals an almost perfect collapse even for large 
values of T and the latter parts of the rare-event tail. 
We found similar scaling (not shown) for the discrete 
cases. Opposed to this, closed walks seem to be more 
strongly affected by finite-size effects, i.e., to tend to¬ 
wards P rather slowly. Nevertheless, in summary 

for all 8 cases the corresponding scaling is supported by 
our data. 

Next, to extrapolate the leading behavior of the distri¬ 
bution in the large-deviation tail, we calculate the empir¬ 
ical rate function $(s) [30] and guard its behavior over 
walk length T. Under the assumtition that away from 
the maximum the the leading behavior of the probabil¬ 
ity is exponentially small in T, i.e., the rate function is 
defined as: 


$(s) = -;^logP(s). 


( 10 ) 


Distributions where the rate function is well defined, i.e., 
follow Eq. (101 are said to obey a “large-deviation prin¬ 
ciple” . To allow a comparison and extrapolation of the 
rate function, s is usually a quantity normalized with the 
maximum possible value, such that s G [0,1]. Thus, one 
would define sa = and sl = t ^ , accordingly. 

Eor the lattice walk model with J = -\/2, these maxi- 
mum possible values were found to be Amax = 


Tmax = 2 • • T for open walks as well as A^ax = \- 

and Pmax = • T for closed walks. But for Gaussian 

walks no actual “maximum” values of A and L exist, so 
in remembrance of the lattice cases to obtain the same 
scaling, we choose sa = ^ or sl = ^, respectively. 

An example of the resulting rate functions, for the case 
of the perimeter of the hull of open Gaussian walks, is 
shown in Eig. Apparently, for large values of s^, the 
rate functions for different walks lengths agree very well, 
while for small values of sl strong finite length effects 
are visible. Anyway, from the visual impression, the rate 
function seems to converge for P —>■ oo pointwise to a lim¬ 
iting function, which starts at the origin. Thus, its seems 
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FIG. 5. Distributions P{A) and P{L) of area A and perimeter L of the convex hull over Gaussian walks of different lengths 
T. (Left) “original” distribution P{A) for selected small values of the walk length T. Main: plot log scale, inset: peak region 
in linear scale. (Middle) rescaling of P{A) according to Eq. (|^. Main plot: log scale; upper right inset: peak region in linear 
scale; lower left inset: fit of Gumbel distribution to T = 1500 data (Right) rescaling of P{A) according to Eq. Q. Main plot: 
log scale; lower left inset: peak region in linear scale; upper right inset: fit of T = 1400 data to “modified” Gumbel distribution. 



FIG. 6. The rate functions ^{sl) of open Gaussian walks for 
different walk lengths (lines). The symbols show the extrap¬ 
olated values (4>oo(s) see text and Fig. [^. The inset shows 
the same on normal scale. 

that the distribution obey the large-deviation principle. 
To quantify this, we performed fits of the form 

$(s,T) = $oo(s)-^^-^ (11) 

for many values of s, as it is demonstrated in Fig. 

We obtained similar results for all other walk types and 
measured quantities. We found that $oo(s) appears to 
grow according to power laws . (The parameters ^ and 
7 are not of interest and show no systematic behavior 
with s). The results are listed in Tab. |T] The values for 
the area are very close to k = 1 and for the perimeter 
very close to k = 2, in particular for the Gaussian walks. 

This can be already seen when one combines the rate 
function for the scaled variables with the scaling forms 



FIG. 7. Dependence of rate function 4 >(sl) (for open Gaussian 
walks) on walk length T at selected values s = 1.2 and s = 0.5 
(inset). A power law was used to determined the T —>■ oo 
convergence towards an asymptotic value <E>oo, indicated by 
horizontal lines. 

Eqs. § and 0- By equating the two expressions for 
the distribution of the area we obtain 

^-ha/tat ^ lp ( A / r ). 

To make the argument of the exponential a function of 
A/T (the factor l/T is of lower order and can be ignored) 
$(s) ^ s must hold, i.e., = 1. Correspondingly for 

the perimeter we get 

e-^(L/TyT ^ ^ 
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Model 

KA, PiA) 

KL, P(T) 

Gaussian, open 

0.999(1) 

2.03(2) 

Gaussian, closed 

1.06(2) 

2.06(1) 

lattice, open 

1.18(1) 

2.13(1) 

lattice, closed 

1.12(2) 

2.12(1) 


TABLE 1. Resulting exponents k which deterine the power- 
law growth of the asymptotic value <E>oo (s) of the rate function 
<l?(s) with sa or sl, respectively. 


which results in ~ i.e., indeed P{L/Vt) ~ 

e-(LVT^)T = ^ 2 . 

The fact that for Gaussian walks, the exponents n 
agree very well with = 1 and = 2 but not quite 
for the lattice walks is probably due to the discrete struc¬ 
ture of the lattice and due to the limited accessible area. 
These have a strong influence for the given limited length 
T of the walks and influence the rate function in partic¬ 
ular for small and large values of s. In fact the rate 
functions for the lattice case are continuously bending 
up in a log-log plot (not shown), such that one easily 
can find a short interval where the expected exponent n 
appears. Thus, to conclude this part, the data is well de¬ 
scribed by the rate function, i.e., the distribution obeys 
the above large-deviation principle. Hence, in connection 
with the fulfilled scaling forms Eqs. ([^ and Q, we see 
that to leading order and asymtotically the distributions 
are given by P{A) ^ and P{L) ^ e~^ 

The Gaussian and the exponential tails, respectively 
for the distributions of the perimeter and the area, may 
be guessed using very simple heuristic arguments. In¬ 
deed, the perimeter L of the convex hull of a two di¬ 
mensional stochastic process morally scales as s, where 
s represents the span of the one dimensional component 
process. For the 2-d Brownian motion, s is simply the 
span of a one dimensional Brownian motion of duration 
T. The probability distribution P{s,T) of the 1-d Brow¬ 
nian motion of duration T and diffusion constant D is 
well known [H |TT] and has a scaling form 

where the scaling function f{x) is given exactly by 

oo 

fix) = ^ E ■ (13) 


The scaling function has the following asymptotic behav¬ 
ior 


f{x) 


2'n^ X X —)■ 0 

e X ^ oo 

VTT 


(14) 


and is plotted in Fig|^ 

Thus, for large s, P{s,T) ^ exp [—s^/ 4IIT] has a 
Gaussian tail. Gonsequently since L ^ one would ex¬ 
pect a Gaussian tail for the distribution T’(T). Note that 



FIG. 8. The scaling function f{x) in Eq. (131 for the span 
distribution for a 1-d Brownian motion, plotted as a function 
of X. 


for small S (and hence for small L), one would expect an 
essential singular behavior P{L) ^ exp[—6/L^] from Eq. 
(141, where a is some constant. The scaling function fix) 


in Eq. (13), and hence P(T) is almost flat near L = 0, 


and falls off as a Gaussian for large L with a peak in 
between, around which is has an approximate Gaussian 
shape (see Fig. 


P(L) = 






(15) 


Subsequently, for the area we can approximately as¬ 
sume A (X and thus the distribution of y/A should 
also be Gaussian, including a factor XjyfA originating 
from dLjdA ~ XjyfA : 


(16) 

Had the assumption of a Gaussian distribution for the 
perimeter been entirely correct, it would be sufficient to 
take the average and the variance and plot the resulting 
Gaussian together with the data. First, we concentrate 
on the case of the hull perimeter for closed planar Brown¬ 
ian motions, where mean and variance are available ana¬ 
lytically [23] . The mean scaled perimeter is given by [23] 


PiA) = 




A 


L 


Vt 


tt^T 1 


^ =y —■^ = y (17) 


Vt 


where E stands for the expectation value. For the 
variance, we use Goldman’s result for the second mo¬ 
ment 1231. 


EiiL/VT)V = \i7T 


smu 


du — I 


(18) 



























The value of the integral of dusin{u)/u is approxi¬ 
mately 1.852, known as the Wilbraham-Gibbs constant. 
This then leads to the variance: 

0-2 = E{{L/VTf) - E{L/Vt)^ 

3 3 

= ’^(tt • 1.852 -!)-’!-« 0.348 

Before comparing the actual distribution, we compare 
average and variance obtained from our numerical re¬ 
sults. By fitting power laws plus a constant, similar 
to <0- we found /i to converge asymptotically towards 
fiao = 3.937(1), while tr^ fluctuated slightly around an 
average of cr^ = 0.347(1), see Fig. Both values agree 
with the aforementioned predictions. The results of the 
extrapolations for all considered cases of the different 
walk models are listed in Tab. |llj and it can be seen 
that the asymptotic values of the parameters of the dis¬ 
tributions agree rather well between lattice and Gaussian 
walks, as expected. Note that for the fitting the results 
the stated error bars are only of statistical nature, thus 
do not include systematic contributions due to the un- 
kown scaling behavior. Hence, the real error bars should 
be considerably larger. Thus the results for the averages 
/ioo can be considered to match well the corresponding 
calculations in Refs.[T7l[24j which read: 


TTl TT 

(Hop) = — ^oo = 2 « 1-571 (19) 

7tT TT 

(Hci) = — /Too = 3 « 1.047 (20) 

(Lop) = Moo = « 5.013 (21) 


Model 

Gaussian 

Gaussian 

lattice 

lattice 


Open 

closed 

Open 

closed 

Area distributions P{A) 


1.585(1) 

1.049(1) 

1.577(1) 

1.049(1) 

'2. 

^oo 

0.312(1) 

0.088(1) 

0.309(1) 

0.089(1) 

Perimeter distributions P(L\ 



5.023(2) 

3.937(1) 

5.015(1) 

3.938(1) 

^oo 

1.075(1) 

0.347(1) 

1.075(1) 

0.348(1) 


TABLE II. Resulting asymptotic values of average /i and 
variance of rescaled hull area A or perimeter L of the 
mentioned walk models. These values have been obtained 
by fitting to the results of and cr^(r) with growing 

walk length, hereby extrapolating the T —>■ oo case. For 
equal walk topologies (open/closed walks) and quantities 
(area/perimeter), the asymptotic values agree heedless of the 
actual step type, i.e., the walk geometry. 

However, as demonstrated in Fig. when actually 
plotting the numerical data together with a Gaussian 
parametrized by the analytic values, one sees that distri¬ 
bution only matches the numerics in the main region of 
the measured distributions L’(L), while the tail exhibits 
a different behaviour. On the other hand it is indeed 


possible to fit Eq. (151 to the tail of P{L) with good pre¬ 
cision. This means the shape of the distribution is Gaus¬ 
sian there. Nevertheless, the resulting value of « 1.35 
is considerably larger than the aforementioned, and the 
corresponding value of /x « 0.1 for these fits seems use¬ 
less. In the same way, for the other cases of perimeter 
distributions (not shown), one can fit Gaussians well ei¬ 
ther to the main region of the data or to the tails. But it 
is not possible to fit the data using one single Gaussian 
over the full support. Thus, the approximations which 
lead to the assumption of a Gaussian distribution were a 
bit too strong and the true distributions seem to be very 
Gaussian-like, but slightly different. 



FIG. 9. Average (rescaled) perimeter ^ and variance for 
closed Gaussian as a function of the walk length T. A conver¬ 
gence to the analytical expectations fi « 3.937 and ~ 0.348 
is visible. 


Regarding the area distribution P{A), the mentioned 
propierties are demonstrated in Fig. 0 Given the pre¬ 
vious results for the perimeter, we here just performed 
fits in a similar way. A Gaussian of the form of Eq. (16) 
can indeed be employed to fit the area distribution either 
in the high-probability region or everywhere away from 
the central region of the distribution, but again not over 
the full support. Thus, the same concusion as for the 
perimeter distribution holds. 

Although the focus of our simulations was on the right 
tails of the distributions, i.e., the large range of sup¬ 
port for larger than typical values, we have also per¬ 
formed some simulations to study very small values of 
the perimeter, to verify scaling function (14) leading to 
an expected ^ /t) bei^avior. 

The main plot of Fig. 12 shows, as example, the re¬ 
sulting rescaled distribution for open Gaussian walks of 
length T = 800 together with a fit to this functional form. 
Apparently, the data follows the predicted form very well 
in the range of small perimeters. We have furthermore 
studied the asymptotic behavior of the factor b in the ex¬ 
ponent and found (see inset of Fig. 12) that the behavior 
is compatible with a convergence to a limiting value. 
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FIG. 10. Scaled distributions of the perimeter L. While the 
scaling assumption collapses the distributions P{L) onto simi¬ 
lar behavior within the tail as well as in the main region (large 
inset), the Gaussian distribution according to Eq. ( |15[ ) (small 
inset) with the values of /i and taken from the analytics 
suits only in the main region. 



FIG. 11. Fit of a Gaussian according to Eq. (161 to P(A) 
with a focus on either the tail (large plot) or the region around 
the average (inset). It was not possible to fit one single Gaus¬ 
sian over the full support. 


FIG. 12. (color online) Main plot: rescaled distribution (sym¬ 
bols) of the perimeter for the open Gaussian walks for small 
values of the perimeter, length T = 800 together with a fit 
to the function Inset: Scaling behavior of the 

parameter h as function of walk length, for open and closed 
Gaussian walks, respectively. The lines represent fits to func¬ 
tions of the form + dP, resulting in baa = 129(9) (open) 
and boo = 250(50) (closed), with a very slow convergence 
(r? = -0.21(3) and rj = -0.09(2)). 


This leads to the natural assumtion that the perimeter 
L and the square root \/A of the area are Gaussian dis¬ 
tributed. Indeed either peak region of the data or the 
tails region fit this form very well, but it is not possible 
to fit over the full range of the support one single func¬ 
tion. Nevertheless, for almost all our results we found 
asymptotic agreement between the Gaussian and the lat¬ 
tice walk case, as expected. 

Since the application of large-deviation simulation ap¬ 
proaches for this problem turned out to be very useful, it 
should be applied for similar problems. For future work, 
besides obvious extensions like considering higher dimen¬ 
sions, we want to consider other random walk models, 
particularly multiple interacting walkers [421143j , in sim¬ 
ple self-avoiding walks as well as in loop-erased random 
walks [111 05] and, most importantly, for applications to 
biology like the formation of animal territories |9]. 


V. CONCLUSIONS 

We succeeded with the application of the above ex¬ 
plained large-deviation scheme to the study of properties 
of convex hulls of random walk models. We could obtain 
the corresponding distributions over broad ranges of con¬ 
vex hull area A and perimeter L, down to densities and 
probabilitiues as small as 10“^™. 

The scaling behavior of these distributions turned out 
to be linked to walk length T in the same way as the 
averages (^) ^ T and (L) ^ Vt. Also, the examination 
of the empirical rate functions resulted in simple power 
laws, which are compatible with the scaling behavior. 
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